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Abstract. The symbolic complexity of an infinite word W is the function pw{l) counting the number of 
different subwords in W of length I £ N. In this paper our main purpose is to study the complexity for a 
class of topological dynamical systems, called iceberg systems, given by the following symbolic procedure. 
Starting from a given finite word w± we construct a sequence of words 

w n+1 = w n p anW (w n ) ■ ■ _i)(u>»), 

where p a (w n ) is a cyclic rotations of the word w n , and consider an infinite word extending each w n 
to the right. It is shown that for iceberg systems given by the randomized parameters ct n (j) the complexity 
function satisfies the estimate p U j oo (0 > i 3— E for any e > 0, and at the same time it is observed that this 
estimate represents up to a small correction the optimal lower bound for the complexity function, namely, 
Pw n+1 (ln) < l n along the subsequence l n = \w n \ + 1. 
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1. Introduction 

The symbolic complexity of an infinite word W is the function pw (I) counting the total number of different 
subwords in W of length I e N. In a series of recent investigations the concept of symbolic complexity is 
applied to the study of orbit structure both in topological dynamics and ergodic theory of measure preserving 
transformations (e.g. see [2, 3, 4, 5, 9, 10]). Further, a kind of invariant of measure-theoretic isomorphism 
extending the concept of symbolic complexity is introduced by S. Ferenczi in [6]. Another general approach 
to the complexity or orbit structure introduced by A. Vershik is based on the notion of scaling entropy [14]. 

Out investigation is motivated by the idea of applying the concept of complexity and coding arguments 
to the problems of isomorphism in ergodic theory of measure preserving transformations with zero entropy. 
We would like to start with mentioning the classical symbolic construction of a rank one dynamical system. 
Given a non-trivial finite word V\ in a finite alphabet A, we define words 

(1) v n+1 = v n S "° v n s - 1 v n ■■■ s — «„, 

where symbol "0" used to create small spacers between the copies of the word v n , and an infinite word 
having each v n as a prefix. Following the standard scheme we can consider the compact set K defined as 
the weak closure of all translates of and endow K with the standard measure which is invariant under 
the shift trandformation T. The triple (T, K, fj,) is called rank one transformation. 

Rank one transformation serves as a classical example of dynamical system with simple spectrum. In pa- 
per [12] a class of ergodic maps is introduced, which is similar to rank one systems and extends this class. 
Namely, generalizing the rule (1), while defining w n+ i we combinate not only exact copies but any cyclic 
rotations of the previous word w n . It is observed that the measure preserving map associated with this new 
symbolic procedure, called iceberg transformation, has always 1/4-local rank and, under certain conditions, 
simple spectrum. It is not obvious that the new class is actually larger than the class of rank one trans- 
formations. At the same time, these two classes potentially could contain transformations having spectral 
types with similar properties. Thus, it is interesting to find an invariant helping to distinguish these classes, 
and, in fact, it can be easily deduced from the results of this paper that the typical complexity of iceberg 
systems is p Woo (l) > l 3 ~ £ , hence, a wide class of iceberg systems is certainly goes beyond the class of rank 
one systems, since for any rank one map there exists a finite partition of the phase space generating a 
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word with the property p Vx> (l n ) < \l\ (see [7]) in contradiction with the given lower bound, proving 
the existence of iceberg systems which are not rank one. Though, one can observe that this approach is 
essentially non-spectral, hence, there are no obstacles to existence of a pair (7i, T2) of spectrally isomorphic 
maps, where T\ is rank one and T 2 is an iceberg map, but not rank one. Thus, this class of symbolic systems, 
more general than rank one, can be considered as a source of new examples demonstrating that rank is not 
a spectral invariant. 

It is interesting to compare the lower boud for symbolic complexity of iceberg systems with the asymp- 
totics of the Pascal adic transformation complexity, p(l) ~ l 3 /6 established by X.Mela and K.Petersen 
[10]. 

2. Iceberg dynamical systems 
We start with the a series of constructions and definitions. 

Definition. Consider a finite alphabet A and a word w = ai . . . ajv in A. We use the notation \w\ for 
the length of the wor w. Given an integer number a, < a < \w\, let us define the cyclic rotation p a (w) of 
the word w as follows 

pa{w) — W2W1, whenever w = W1W2, |wi| = a. 

Observe that if b £ A is a letter and u is a word then p\(bu) = ub and p a = (pi) a ■ Let us extend this 
definition to all values of a £ Z setting 

p (w)=w and p a (w) = p a+k \ w \ ( w ), k £ Z. 

The next definition is the starting point of the construction of iceberg systems. We start with defining an 
infinite word w^, then we consider the closure K Woo of the sequence {u?oc} in weak topology, and finally 
using the sandard procedure we endow the compact set K Woa with an invariant measure p^™^ and we come 
to an ergodic transformation T acting as left shift on the space (K Woo , /i^™ 00 ') (sec [12]). 

Definition. Let to 1 be a fixed finite word in A. We suppose that w\ is non-trivial and contains a pair of 
different letters. Let us construct a sequence of words w n , such that each next word w n+ i in the sequence 
is a concatenation of cyclic rotations of w n , namely, 

( 2 ) w n+1 =p an ( )(w„)p an (i)(tu„)---p an (g n _i)(tu„), 

where q n > 2 is the number of entries p an tj\(w n ). Now suppose, for simplicity, that a n (0) = for any n so 
that w n+ i extends w n to the right, and after the infinite sequence of steps we come to an infinite word 
such that any w n is prefix of w^. Let K Wao be the closure of {if 00} in weak topology, i.e. the set of all 
sequences x = (xj)j e z such that any finite subword of x can be found as a subword of some w n . The 
compact set K Woo is invariant under the left shift map 

(3) T: (...,x ,x 1 ,...,x J ,...)^{...,x 1 ,X2,...,x : j +1 ,...). 
We call (T,K Woo ) an iceberg topological dynamical system (without spacers). 

2.1. Symbolic complexity. Given an infinite word w let us consider the language C(w) defined as the 
set of all finite subwords of the word w. We can partition the set C(w) into a sequence of subsets C(w, I) 
according to the length of a subword, 

C(w,l) = {u£ C(w): \u\ = I}. 

Symbolic complexity p w (l) of the word w is the function counting the number of different subwords in w of 
length I or, in other term, the function measuring the complexity of C(w), 

Pw(l) = #£{w,l), 

where j^A is the cardinality of A. 

Our paper is devoted to the proof of the following observation. 
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Theorem 1. Consider an iceberg system given by the independent uniformely distributed parameters a n (j), 
and assume that q n — > oo sufficiently fast, q n > with 7 > 1, where h n — \w n \. Then almost surely 

(4) p Woo (l)>l 3 ~ E 
for any e > and I > lo(e). 

2.2. Measure-theoretic point of view. Throughout this paper we will continuously compare pure com- 
binatorial observations with the corresponding effects in the context of measure-theoretic ergodic theory. 
Infinite words are interpreted in this case as discrete sample path of stationary random processes. 

The concept of iceberg transformation can be extended if we endow the compact space K Woo with the 
natural invariant measure [i = ^ Wao " 1 generated by the set of empirical distributions fn dcfinied as follows: 
given a finite word w £ £(w 00} l) 7 a word of length I, the value m{w) is the asymptotic frequency of occurence 
of w inside the infinite word Woo. This procedure leads to an ergodic dynamical system given by the left 
shift map T acting on the probability space {K Wco ,B,fi^ w ^), where B is the sigma-algebra of Borel sets. It 
is shown in paper [12] that an iceberg map can be obtained as a result of cutting-and-stacking construction 
in a way independent on the symbolic formalism we discuss (see fig. 1). 

The main idea of our paper is to provide a combinatorial background to further study of a typical symbolic 
complexity of ergodic iceberg systems with respect to the distribution ^i''"™' (cf. [6] and [14]). 

When studying our dynamical system as an ergodic transformation it is naturally to ask if the underlying 
property of the iceberg construction can be formulated in an invariant form, as a kind of dynamical property 
preserved under any measure-theoretic isomorphism? 

Indeed, let us consider the classical concept of rank one approximation. 

Definition. We say that a measure preserving map T on a probability space (X, fi) is of rank one if 
for any measurable partition V = {Pi, . . . , P m } and any e > there exists a set fi e of [i- measure 1 — e such 
that for any x G 51 e the orbit of x represented in a form of an infinite word (xj), where T^x e P Xj , satisfies 
the following property: (xj) is e-covered by disjoint words Wj which are e-close to some fixed word W( s y. 

(5) ( Xj ) = ---W 1 S 1 W 2 S 2 ■■■ Sj-! Wj-- - , 
where Wj concides with W^ up to at most e | | wrong letters. 

Definition. Let T be a measure preserving transformation. We say that T admits iceberg approximation 
if for any finite measurable partition V and any e > one can find a set f2 e of measure 1 — e such that for 
any point x e il E the orbit (xj ) of x is e-covered by words Wj which are e-close to cyclic rotations of a fixed 

W {s y- 

(6) (xj) = ...W 1 S 1 W 2 S 2 ...Sj- 1 W j ..., d{Wj, Pipj (W (e) )) <s, 

where d(u,v) measures the fraction of non-matching letters in u and v. 

It is important to say that a priory we cannot skip spacers in this definition. Thus, generally speaking 
a common construction of an iceberg map should contain spacers as well, but, at the same time, it is 
interesting to observe that from the complexity point of view adding spacers between cyclic rotations we do 
not influence significatly to the asymptotics of p Wao (see the effect discussed below in remark 3). 

3. Upper bound for symbolic complexity along a subsequence 

We start with the estimation of the symbolic complexity for the sequence of special kind 
l n = h n + 1 = \w n \ + 1. The idea of this calculation is to show that the lower bound discussed in the 
next section is, in a sense, optimal. 

Lemma 1. p Wn+1 (h n + 1) < hf t for any iceberg system. 

Proof. Let us consider a subwords v of length h n + 1 = \w n \ + 1 inside w n +\, and consider w n+ i as 
a sequence of rotations p an ^(w n ). Suppose that v starts at position k so that v includes the letters 
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Figure 1 . Sample of an iceberg. A column is an ordinary Rokhlin tower corresponding to 
a cyclic rotation of the word "afecde/" . Transformation T lifts each set drawn as a square 
to the upper square. Whenever a point reaches the top set in a column, it is mapped by T 
to an arbitrary bottom set of another (or the same) column. 

Pa„0-l)(tu„) Po„(j')(t»„) Pa„0'+l)(u>„) Pa n (j+2)(w n ) 



1 1 'LZB 1 

Figure 2. Estimation of p Wn+1 (l) for I with / = \w n \. The rectangle restricts a subword v 
of length I. The vertical lines mark up the boundaries of blocks p an ^(w n ). 

at positions k..k + h n and it covers exactly two adjacent subwords: p a /j\(w n ) and p an (j+i) {w n ). It is 
easy to see that the subword v is uniquely determined by the value of the triple (k,a n (j),a n (j)), where 
k = k (mod h n + 1), k ^ 0, and the total number of such triples is estimated as h\. This ides is illustrated 
on fig. 2. Let us summarize the estimates in the following line: 

Pw n+1 (h n ) <hl-hl + h n , Pw n+1 {h„ + 1) < hi. □ 

Now let us consider an iceberg system with spacers given by the scheme 

w n+1 =w„0 s "-V On(1) K)0^ 1 ••• s "''"- 2 / o Qn( q ii _ 1) (w n )0 s "^- 1 , 

and observe that spacers do not affect to the asymptotics along the sequence h n + l. We write wi ^ w if 
w\ is a subword of w. 

Remark (Melting word effect). Consider a subword 

v = w 1 Sw 2 , w 1 ^p ipi (w n ), w 2 < p V2 (w n ), 

of length l n — \w n \ + 1 that covers a part w\ of the block p a u\{w n ), a part w 2 of the block p a „(j+i) ( w n)> 
and a spacer S = s . Let us imagine that the spacer S is growing to the left and becomes v symbols larger. 
It can be observed that the growth of the spacer can be compensated by the opposite cyclic rotation in the 



LOWER BOUNDS FOR SYMBOLIC COMPLEXITY OF ICEBERG DYNAMICAL SYSTEMS 



5 



first block p an ^(w n ) by — v positions. In other terms, the original word v coincides up to v letters with 
a new word v given by the configuration 

v = w 1 Sw 2 , w\<p Vl - v (w n ) and S = S+V , 

and a priori there are no reasons to say if v coincide with v or not? If we interpret the area covered by v 
as a window we monitor the sequence p an {j){ w n) through then it looks like the block p ipi (w n ) is "melting" 
from the right side. We erase v letters in the subword w\ while extending the spacer. Furthermore, from 
the statistical point of view the words v and v are asymptotically d -close with /i-probability 1 — e n , where 
e n — > 0, if spacers are supposed to be asymptotically small. 

Definition. Let us consider the order on finite words: wi > u>2 if Wi differs from u>2 by replacing 
some letters to the spacer symbol "0". We call saturated complexity of an infinite word w the function 
p w (l) counting the the total number of maximal points with respect to the order ">" in the set £(w,l) of 
subwords in w of length I. 

Theorem 2. p Wn+1 (h n + 1) < h n for any iceberg system with spacers. 
This theorem follows directly from the discussion of the privious remark. 

4. Lower bounds for symbolic complexity 

Now we pass to the discussion of lower estimates for p Waa (I). The idea is to observe that generically, for 
iceberg systems with sufficiently rich combinatorial structure almost all configurations 

V = W X W 2 di p Vl (w n )p V2 (W n ), W 1 <P l p 1 { w n), W 2 <P<p 2 (w n ), 

and, respectively, almost all triples (|u>i|, <f>i, 4>2) are observed inside the word w n+ i. The following lemma 
is a simple combinatorial observation. 

Lemma 2. Let us consider two large intervals on the integer line [k, k + m] and [I, I + m] of the same 
length m, and assume that I — k ^ 0. There exists a subset Ac [k, k + m] satisfying conditions 

An(l + k-A) = and #A > ^ m (1 + o(l)). 

For the sequel it is convenient to find and fix a value mo such that f^A > -| m in the above lemma 
whenever m > m . 

Let us consider an iceberg system given by the independent random cyclic rotation parameters ct n (j) 
uniformely distributed on the integer interval [0 .. h n ), where h n = 

\wn\- Let us use notation u< D w in the 
case if u is a subword of some cyclic rotation of w, i.e. u ^ p<j>{w) for some </>. This new class of dynamical 
systems generalizes the class or randomized rank one dynamical systems introduced by D. Ornstein ([11], 
see also [1]). 

Definition. We say that our symbolic system satisfies property V m ((3) with f3 > if any two subwords 
u do w n and v ^ w n of the same length m > (3\w n \ are always identical whenever they are equal. Here we 
use the word identical to define the case when two subwords has the same starting position in w, i.e., in 
fact, they corresponds to one entrance of a subword. For example, two entrances of the word u = v = cat 
in a lcrger word w — littlecatandbigcat starting at position 6, and position 15 are equal but not identical. 

In the sequel we will use symbol P(A) for the probability of the set A according to the probability space 
hosting the random parameters a n (j). It is convinient to consider two letter alphabet A = {0, 1}. Assume 
that the initial word in the iceberg construction W\ contains i/ ■ |wi| ones and (1 — v) ■ \ w\\. It follows easily 
from the definition that any word w n contains exactly ^-fraction of ones. The following lemma mainly 
concerns the n'th step of the construction when we build the new word w n+ i combining a sequence of q n 
random cyclic rotations of w n . 
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Lemma 3. Consider a pair of independent cyclic rotations, for example, 

Wi = p an (j)(w n ) and W 2 = Pa n (j+i)(w n ) 

and a pair of intervals I\,I 2 c {0, 1, ... , \w n \ — 1}. Let v\ = W\\i 1 and vi = W 2 \i 2 be the two subwords 
corresponding to the subsets of indexes I\ and I 2 . Then the probability of matching V\ = v 2 is less than 
l-2v + 2v 2 . 

Proof. It is enough to show that the conditional probability of matching v\ = v 2 with respect to the finite 
set of random parameters that influenced to the structure of w n . The probility of matching is less than the 
probability of matching of letters at only one posotion in 1\ . The uniform random rotation generates at any 
position the random variable which is distributed as '0' with probability 1 — v and '1' with probability v. 
Thus, the one-letter matching process is equivalent to matching of two independent varibles given by the 
vector {1 — v, v\. □ 

Remark that this lemma can be significantly strengthened, and our strategy in this paper is to avoid com- 
plicated investigation of random processes including large deviation mechanism and to use only elementary 
probability technique. 

Lemma 4. Assume that (3 > max{36, m + 1} • q^ 1 ■ Then 

P(V n+1 (/3)\V n (l/2))>l- q l-h^ 12 . 

Proof. Suppose that T> m (l/2) is satisfied. Consider two subwords V\ and v 2 of w n+ i (or its cyclic 
rotation, further we will omit such kind of remark) starting at position k\ and k 2 respectively. Assume also 
that 

\v\\ = M = M > P\w n+l \ = I3q n h n . 

Both vi and v 2 arc of the form: 

(head) t \ / \ (tail) 

Vi = v\ P4>iA W n)---P4>i,N 1 ( W n) V i ■ 

In other words, it starts and ends with two incomplete blocks, enclosing the sequence of complete blocks 
PfcjiiVn). In order to understand if v\ and v 2 are matched, v\ — v 2 , let us write both v\ and v 2 starting 
at zero position. At this point we must consider two cases which are essentially different. The first case 
is related to the pair of subwords which are very "close" , that is the first positions y\ and y 2 of V\ and v 2 
satisfy inequality 

\yi < 2 l^^l- 

And the second case deals with the situation when matching v\ and v 2 actually we match pairs of indepen- 
dently rotated blocks. 

In the first case we can simply apply axiom T> n (l/2) to the situation when a word w = p${w n ) is matched 
with itsels shifted by non-zero amount of positions, since the size of overlapping is equal or grather than 

\w n \ - \vi - 2/2 1 > \w n \/2. 

Concerning the second case it can be easily seen that v\ and v 2 contain a sequence of overlapping block 
pairs 

(Pa n (j)( w n),P an U+s)(Wn)), j = jo , ■ ■ ■ , jo + m - 1, 

satisfying the following conditions: 

, X , M 1 1 M 1 n 

\P an (j){wn) n p anU+s) {w n )\ > -\w n \, m > - 1— i > -fiq n . 

In this formulas we shortly write j + s instead of j + s (mod q n ), where q n is the total number of subblocks 
p a tj\{w n ) in the rotated word iu n +i. Now let us apply lemma 2 to the intervals [jo, jo + m — 1] and 
[jo + s ,3o + s + m — 1] on Z and find a set J such that #J > 1/3 • m and J n (s + J) = 0. Observe that 
the pair of random variables (a n (j), a n (j + s j) for j e J arc globally independent, since all sets {j,j + s}, 
j € J, are disjoint. Now applying axiom 2?„(l/2) we come to the following conclusion. The only possibility 
for a pair of overlapping blocks p an ^(w n ) and Pa n (j+s){ w n) to match is to encounter the situation when 
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these two blocks are proper rotated and the word in the overlapping enter identically in both blocks, and 
the probability of such events is exactlty ft" 1 . Thus, counting all the pairs of subwords v\ and v 2 under 
testing, we have 

P{V n+1 (J3) | Z>„(l/2)) > 1 - h 3 n+1 ■ K# J > 

□ 

Suppose that some decreasing sequence /3 n — >• is given. Applying the previous lemma we can easily find 
a sequence q n such that for any n the probability discussed in the lemma is positive and we can take for 
any step of the construction an appropriate configuration (a n (0), . . . , a n (q n — 1)) such that V n ((3 n ) is true 
for all n. 

Though, remark that this reasoning still has a strong logical gap. In fact, to start this process we 
need a starting word Wi satisfying £>i(l/2) "a priori", and actually we can do it by choosing such "never 
matching" word like 

10, 100, 1001, 

but in view of futher applications to the isomorphism problem we need a version of this lemma that can be 
applied to any starting word W\. 

Lemma 5. Suppose that (3 > max{36, m + 1} • q~ x and assume that the initial word of the construction 
contains v -fraction of symbol 1. Then 

P(2?„(l/2)) > l-2<£- tT^ 12 , 

where i] = 1 — 2i> + 2i> 2 . 

Proof. Let us examine once more the first case in the proof of lemma 4 taking into account that now we 
cannot say nothing about the matching of large blocks in w n . An example of such word poorly amenable 
to decoding is 1010101010 ... 10. 

Without loss of generality we can assume that in a pair of matched subwords v\ and v 2 the second 
subwords starts just one position to the right of (the beggining of) v\. In other words, if y 2 = y\ + 1. The 
only iformation we have is just that the last symbol of any full block p an (j){w n ) in v 2 is compared to the 
first symbol of the next full block p an (j+i){w n ) included in v\, and the probability of this event is less than 
7] (see lemma 3). The additional summand to the probability of matching is estimated as 

h 2 h -n* J <a 2 ■ 7,-^/12 

n n+l ll n '/ ^ q n 1 1 

Further, if \y 2 — yi\ > \w n \/2 then the estimate is given by the value 

h 3 h -n* J <a 3 ■ n - fjq " /12 

and the result follows. □ 

The idea of the forthcoming discussion is to examine, modulo axiom T> n ([3 n ), how many subwords v of 
length h n can appear asymptotically in the word w n+ \. Matching axioms V n imply that counting subwords 
is, in a sense, equivalent to counting configuration (|wi|, <f>\, <j>2) describing how v covers two adjacent blocks 
Pa n (j)( w n) Pa n (j+i)( w n) _^ Wn+i- Let us consider triples (^,^1,^2) indexing such kind of configurations. 
Recall that for any subword v in w n+ i of length h n , 

v ^ p an ( 3 )(w n ) p anU+1) {w n ). 

Here <j>i — a n (j) and <f> 2 — a n (j + 1) are the corresponding cyclic rotation parameters of the adjacent blocks 
covering v, and £ has the meaning of point separating these blocks inside v. Notice that any tripple defines 
in a unique way a word v that can occur as a subword of w n+ \. Let us denote this word as 

v = v{t,<j>u<h)- 

Our purpose is to find a set of configurations such that the corresponding words V(£, <fii, 02 ) arc different. 
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Lemma 6. Consider two tripes (£,0i,02) and (rj, i/j\,ip 2 ) anc ^ suppose that 

PnK <Z<V< {l-/3 n )h n , 
\£-rj\> P n h n , 

and axioms T> n (/3 n ) are satisfied. Then V(£,<f>i,<f> 2 ) — T^(?7, 0>i, ^2) if and only if for any interval among 
[0, £), [£,T]) and [r),h n ) the corresponding subwords are matched identically, in other words, if 

v - £ = ipi - 4>\ = i>2 - <f>2 = h n + ipi - <f> 2 - 

Observe that if the conditions of lemma 6 are true then evidently V"(£, 0i, 02) 7^ V(jj, -01, V'2), except one 
case: 0i — 02 and Vi = "02 which means that the adjacent blocks in the iceberg construction are cyclicly 
rotated by the same amount of positions. In this case the boundaries given by the positions £ and n are 
transparent and cannot be recognized, since p^ww) — p ( p(w)p ( j > (w). 

Proof of lemma 6. The idea of this lemma is to require that all the intervals that appear while matching 
the words generated by these two configurations must be sufficiently long, so that we can apply property 
T^n(Pn)- In fact, for intervals [0, £), [£, 77) and [77, h n ) we know that the corresponding subwords in the blocks 
must be identically located subwords, and it is easy to represent this conclusion in terms of parameters £, 
77, (0i, 2 ) and Oi, ip 2 ))- □ 

Lemma 7. Suppose that all the triples (£, (j)\,(j> 2 ) are found for subwords in w n+ \ of length h n . Then 
counting configurations with 0i ^ (f> 2 we have 

Pw n+1 (h„) > 1 R 4f3n ■ {h 2 n - h n ) 

Pn 

if Pn > K 1+s and0<5 < 1/2. 

Proof. Let us choose a progression k, k + a, . . . , k + (in — l)a of length m, where 

k = a = [/3 n h n \, 

(1 - 20 n )K 



m 



> (1 - 2f3 n )h n _ i > (l-2p n )(l-(3 { n 1 - s)/s ) _ i > 1 - 40„ 



P n h n + 1 Pn Pn 

sinse (1 — 5) /delta > 1 for 5 < 1/2. Any pair (t\,t 2 ) in the set of triples 

s = {( K + 01,02): 01 ^02, < j < m} 

satisfies the conditions of lemma 6 and in addition we can state that V(t\) ^ V(t 2 ), since 0i ^02- Finally, 
let us count the triples in E: 



#£ = m • (h 2 n - hn) > ■ (hi - K). 

Pn 



□ 



Theorem 3. Suppose that e > is given, and q n satisfies the asymptotics q n ~ h% with 7 > 1 — e. Then 
our iceberg system given by the sequence of i.i.d. random parameters a n (j) with \w n +i \ — q n \w n \ almost 
surely generates an infinite word Woe such that for n > hq 

p Woc (h n ) > Pw n+1 (h n ) > h^ £ 

for n> n . 

Proof. Let us take f3 n = h~ 1+s with max{l — 7, 0} < S < e. In order to apply lemma 5 and establish 
property T> n ((3 n ) let us look at the probability in 5: 

P(2>„(l/2)) > 1 - 2ql ■ V -^ 12 = l-£ n 

and 
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where (3 n q n = /i~ 1+5+7 and — 1 + <5 + 7 > 0. Since the series J2 n &n converges, we can apply Borel-Cantelli 
lemma and see that almost surely there exists n\ such that T> n (fi n ) for n > n\. The symbolic complexity is 
now estimated as follows: 

h 2 

P Woo (/in) > Pw n+1 (h n ) > const ~ = const -hi' 6 > h z ~ e , 

Pn 

whenever n > n > n\. □ 

This theorem can be strengthen as follows to get symbolic complexity arbitrary close to 

Theorem 4. Consider a sequence q n ~ /i 7 , 7 > 1, defining a set of random iceberg systems given by 
the independent and uniformely distributed random parameters a n (j). Then almost surely the symbolic 
complexity for this iceberg system satisfies the inequality 

Pwooihn) > hl~ £ , n > n {e). 

The only difference with the proof of the previous theorem is that we have to fix some maximal value £q 
and a universal sequence /3„ = h~ 1+Sn such that S n — > sufficiently slow. 

It was observed by S. Ferenczi [5] that the lowest rate of growth for the complexity function conserning 
rank one systems can be seen along the subsequence h n . Thus, this case is concerned as the most difficult 
if we are interested in the estimate from below (see also lemma I). Now, the above theorems can be easily 
extended to all lengths of subwords I 6 N. The complexity outside the sequence h n becomes even larger 
(cf. [5] ) , since we have more freedom combinating a word from several rotated copies of w n . Then we come 
to the following result. 

Theorem 5. Given a random iceberg systems with q n ~ /i 7 ; 7 > 1, the infinite word generated by the 
system with probility one possesses almost cubic estimate for the complexity, 

P Woo (l) > l 3 ~ £ , n > n (s), 
where the function rio(e) depends on the system. 

5. Application to the calculation of rank 

Using the effects discussed in section 4 we can show that certain iceberg systems are not rank one. We 
give only an idea of proof. 

Theorem 6. Keeping the conditions of theorem 5 almost surely the iceberg system is not rank one. 

Remark that the method we use is not applicable to the multiple rank property, for example, rank two. 

Sketch of the proof. Suppose that our transformation T is rank one. Then there exists a partition 
P = {P>o,Bi} and an orbit — T k XQ such that the complexity of the word (xk) is quadratic along a 
sequence Hi (see [5]), 

p (Xk) m < \hi 

The most difficult point in the proof is to manage infinite words which are generated by arbitrary finite 
partition P. In other words, the following argument is used to show that the complexity of {xu) is l 3 ~ £ 
providing a contradiction. Having small £, erasing ^-fraction of letters the following estimate remains true: 

p Wn+1 (l)>l 3 ~ S - 

Indeed, our infintie word is covered up to a small error ( n by cyclic rotations of w n and asymptotically 
it is easy to see from approximation that passing through the iceberg with index n names for all points (but 
£ n -part) are £„-close. 

Applying these arguments we must take into account that ( n is small but it can go to zero arbitrary 
slowly. □ 
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6. Scaling approximation and Pascal adic transformation 

Let us consider a tree associated with the Pascal triangle given by the vertice set V = {(k, n): < k < n), 
and the set of admissible paths in this graph: 

X = {(y n ,n) : y n+1 = y n or y n+1 = y n + l}. 

Next, let us consider an order on X induced by the natural order on the verticies on same level: (k, n) < 
(k + l,n). Pascal adic transformation T is the transformation on X that maps a path x to the smallest 
path grater than x (up to a small neglectible set). Pascal map T becomes ergodic measure preserving 
transformation if we endow X with the natural measure describing statistics of paths (sec [13]). 

We mention Pascal transformation in this paper by two reasons. First, it belongs to the class of measure 
preserving maps containing iceberg systems as well and based on the concept of scaling approximation 
introduced below in this section. The second reason is the following theorem establishing the exact cubic 
asymptotics for the symbolic complexity of T (see [10]). 

Theorem 7. (X.Mela, K. Petersen) The symbolic complexity of the Pascal adic transformation T satisfies 

p(0~|/ 3 - 

Definition. We say that a measure preserving invertible map satisfies the property of scaling approxi- 
mation of rank one with a scaling function \(h) (or simply scaling rank one) if for any e > almost every 
orbit encoded with a finite measurable partition can be e-covered by subwords Uj of a fixed word | \ = h 
such that the average length of uj is greater than \(h) (1 + o(l)). 

The property of funny scaling approximation of rank one, by analogy with the funny rank one property, 
is definicd in the same manner but taking instead of subwords arbitrary restrictions Vj — W / ( e )|/(j) of the 
words W( E ) considered as a function W( e ) : [0 .. |W( e ) — 1) — > A. 




Figure 3. On this figure we put the collection of towers in the cutting-and-stacking pro- 
cedure for Pascal adic map, drawn in such a way that all the elementary sets sharing the 
same level are always labeled with the same letter. Here the 9'th step od the construction 
is shown, the vertical columns are the Rokhlin towers of height C(n, k), where n = 9 and 
k = 0,1, . . . ,n, and the common height of this generalized tower is equal to h 9 — 2 8 = 256. 
Red lines correspond to symbol '1' and blue lines - to symbol '1'. Note: you can zoom 
this picture and see in details any part of the coding sequence xq,xi, . . . , X255. 

Theorem 8. Any rank one (respectively rank m) transformation has the property of scaling approximation 
of rank one (rank m) with X(h) = 1. 

Theorem 9. Any iceberg map is of scaling rank one with \(h) = 1/2. 



LOWER BOUNDS FOR SYMBOLIC COMPLEXITY OF ICEBERG DYNAMICAL SYSTEMS 



11 



This theorem follows from the simple observation that any cyclic rotation of the word w n is a kind 
of interval exchange involving two parts, one of length a|io„| and another of length (1 — a)w n which are 
compensated: ^(a + (1 — a)) = i. 

Observation 1. In can be easily seen that if X(h) > const > then the map has positive local rank 
(see the scheme of the proof in [12]). 

Observation 2. Pascal adic transformation admits scaling approximation of rank one with the scaling 
function 

(7) \(h) = 1 

V 77 lo S2 h 

The picture on figure 3 is the result of reconstruction of the usual cutting-and-stacking representation 
of the Pascal map, when the towers of height C(n,k) are lifted so that the sets on the same level of the 
generalized tower are always marked by the same symbol. Here C(n, k) are the binomial coefficients. 

Question 1. What is the precise symbolic complexity of the iceberg system with random rotations? 

Question 2 (iceberg systems with multiple IET). Let us consider a class of systems given by the more 
general construction: instead of cyclic rotation (which is in fact a kind of discrete two interval exchange 
map) we apply a random r- interval exchange transformation to the words w n (see [12]). What is the typical 
asymptotics for the symbolic complexity for this ensemble of symbolic systems? 

Question 3. Is it true that the scaling function given by (7) is optimal for the Pascal adic transformation? 

Acknowledgements. The author is very greatful to M. Lemanczyk and S. Ferenczi for the discussions 
concerning the subject of this work, largely inspired by papers [5] and [8] . 

I would like to thank A. Stepin, A.Vershik, B. Gurevich, V. Oseledec, S.Pirogov, J.-P. Thouvenot, 
K. Petersen, V. Ryzhikov, El H. El Abdalaoui and the participants of seminar "Ergodic theory and sta- 
tistical physics" at Moscow State University and "St. Petersburg seminar on representation theory and 
dynamical systems" for the fruitful discussions and attention to this work. 



[10 
[11 

[12: 

[13 
[14 



References 

El H. El Abdalaoui, F. Parreau, and A. A. Prikhod'ko. A new class of ornstein transformations with singular spectrum. 
Annates de I'Institut Henri Poincare (B) Probability and Statistics, 42(6):671-681, 2006. 

P. Arnoux, C. Mauduit, I. Shiokawa, and J.-I. Tamura. Complexity of sequences defined by billiards in the cube. Bull. 
Soc. Math. France, 122:1-12, 1994. 

S. Brick. Enumeration of factors in the thue-morse word. Discr. Appl. Math., 24:83-96, 1989. 

A. de Luca and S. Varrichio. Some combinatorial properties of the thue-morse sequence. Theor. Com/put. Sci., 63:333-348, 
1989. 

S. Ferenczi. Rank and symbolic complexity. Ergod. Th. and Dynam. Sys., 16:663-682, 1996. 

S. Ferenczi. Measure-theoretic complexity of ergodic systems. Israel J. Math., 100:189—207, 1997. 

S. Ferenczi. Systems of finite rank. Colloq. Math., 73(l):35-65, 1997. 

S. Ferenczi and M. Lemanczyk. Rank is not a spectral invariant. Studia Math., 98(3):227-230, 1991. 

S. Ferenczi and L.Q. Zamboni. Languages of fc-interval exchange transformations. Bull. Land. Math. Soc, 40(4):705-714, 
2008. 

X. Mela and K. Petersen. Dynamical properties of the pascal adic transformation, arxiv: 0310311. 2003. 

D.S. Ornstein. On the root problem in ergodic theory. Proc. 6th Berkley Syrapos. Math. Statist. Probab., Univ. Calif., 

2:347-356, 1970. 

A. A. Prikhod'ko. On ergodic properties of "iceberg" transformations, i: approximation and spectral multiplicity, preprint, 
arxiv:1002.2808vl. 

A.M. Vershik. The pascal automorphism has a continuous spectrum. Fund. Anal. Appl, 45(3):173-186, 2011. 

A.M. Vershik. Scaling entropy and automorphisms with purely point spectrum. Algebra i Analiz, 23(1):111-135, 2011. 



Dept. of Mechanics and Mathematics, Moscow State University 
E-mail address: sasha.prihodkoOgmail.com 



